# High-Precision Feature Extraction
Vit So400m Patch16 Siglip 512.v2 Webli
Apache-2.0
A vision Transformer model based on SigLIP 2, designed for image feature extraction and suitable for multilingual vision-language tasks.
Text-to-Image
Transformers

V
timm
2,766
0
Aimv2 Large Patch14 224.apple Pt Dist
AIM-v2 is an image encoder based on the timm library, utilizing distillation training methods, suitable for image feature extraction tasks.
Image Classification
Transformers

A
timm
380
1
Dinov2.large.patch 14.reg 4
Apache-2.0
DINOv2 is a vision transformer-based image feature extraction model that enhances feature extraction capabilities through the introduction of register mechanisms.
D
refiners
15
0
Cvlface Adaface Vit Base Kprpe Webface12m
MIT
Face recognition model based on keypoint relative position encoding, using ViT architecture and trained on the WebFace12M dataset
Face-related
Transformers English

C
minchul
122
1
Cvlface Arcface Ir101 Webface4m
MIT
Deep face recognition model based on the ArcFace loss function, trained on the WebFace4M dataset using the IR101 architecture
Face-related
Transformers English

C
minchul
44
3
Featured Recommended AI Models